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Abstract 

I propose a model for determining the hearer's at- 
tentional state which depends solely on a list of 
salient discourse entities (S-list). The ordering 
among the elements of the S-list covers also the 
function of the backward-looking center in the cen- 
tering model. The ranking criteria for the S-list 
are based on the distinction between hearer-old and 
hearer-new discourse entities and incorporate pref- 
erences for inter- and intra-sentential anaphora. The 
model is the basis for an algorithm which operates 
incrementally, word by word. 

1 Introduction 

I propose a model for determining the hearer's at- 
tentional state in understanding discourse. My pro- 
posa l is inspired by the centering model G rosz 



et al. (|1983 ; 1995 ) and d raws on the conclusions 
of Strube & Hahn 's (1996) approach for the ranking 
of the, forward-looking center list for German. Their 
approach has been proven as the point of departure 
for a new model which is valid for English as well. 

The use of the centering transitions in Brennan 
et al.'s ( 1987] ) algorithm prevents it from being ap- 
plied incrementally (cf. Kehler (1997] )). In my ap- 
proach, I propose to replace the functions of the 
backward-looking center and the centering transi- 
tions by the order among the elements of the list of 
salient discourse entities (S-list). The S-list rank- 
ing criteria define a preference for hearer-old over 



hearer-new discourse entities ( Prince, 198 1| ) gener- 
alizing ^ttube^^^ah^'s (1996) approach. Because 
of these ranking criteria, I can account for the dif- 
ference in salience between definite NPs (mostly 
hearer-old) and indefinite NPs (mostly hearer-new). 

The S-list is not a local data structure associ- 
ated with individual utterances. The S-list rather 
describes the attentional state of the hearer at any 
given point in processing a discourse. The S-list is 
generated incrementally, word by word, and used 



immediately. Therefore, the S-list integrates in the 
simplest manner preferences for inter- and intra- 
sentential anaphora, making further specifications 
for processing complex sentences unnecessary. 

Section ^ describes the centering model as the 
relevant background for my proposal. In Section ^, 
I introduce my model, its only data structure, the 
S-list, and the accompanying algorithm. In Section 
^, I compare the results of my algorithm with the 
resul ts of the centering algorithm ( Brennan et al., 
1987) with and without specifications for complex 
sentences ( Kameyama, 1998 ). 



2 A Look Back: Centering 

The centering model describes the relation between 
the focus of attention, the choices of referring ex- 
pressions, and the perceived coherence of discourse. 
The model has been motivated with evidence from 
pref erences for the antecedents of pronouns G rosz 



etal. (1983 



1995|) and has been applied to pronoun 



resolution (Brennan et al. (1987), inter alia, whose 



interpretation differs from the original model). 

The centering model itself consists of two con- 
structs, the backward-looking center and the list 
of forward-looking centers, and a few rules and 
constraints. Each utterance Ui is assigned a list 

forward-looking centers, Cf{Ui), and a unique 
backward-looking center, Ch{Ui). A ranking im- 
posed on the elements of the C f reflects the as- 
sumption that the most highly ranked element of 
Cf{Ui) (the preferred center Cp{Ui)) is most likely 
to be the Cb{Ui+i). The most highly ranked el- 
ement of Cf{Ui) that is realized in C/j+i (i.e., is 
associated with an expression that has a valid inter- 
pretation in the underlying semantic representation) 
is the Cb{Ui-^-l). Therefore, the ranking on the Cf 
plays a crucial role in the model. [Grosz et al. (1995 ) 
and [Brennan et al. (1987 ) use grammatical relations 
to rank the C/(i.e., subj -< obj -< ...) but state that 
other factors might also play a role. 



For their centering algorithm, Brennan et al. 



(1987, henceforth BFP-algorithm) extend the notion 
of centering transition relations, which hold across 
adjacent utterances, to differentiate types of shift 
(cf. Table [I] taken from [Walker et al. (1994| )). 





Cb{U,) = Cb{U,-i) 
OR no Cb{U,-i) 


Cb(U^) / 
Cb{U,.l) 


Cb{U,) = 


CONTINUE 




Cp{Ui) 


SMOOTH-SHIFT 


Cb{Ui) ^ 
CpiUi) 


RETAIN 


ROUGH-SHIFT 



Table 1 : Transition Types 



Brennan et al. (1987| ) modify the second of two 
rules on center movement and realization which 
were defined by [Grosz et alj ( |1983| ; |1995| ): 



Rule 1: If some element of C/([/j_i) is realized as 
a pronoun in Ui, then so is Ch{Ui). 

Rule 2: Transition states are ordered. CONTINUE is 
preferred to RETAIN is preferred to SMOOTH- 
SHIFT is preferred to ROUGH-SHIFT. 



The BFP-algorithm (cf. [Walker et al. (1994| )) con- 
sists of three basic steps: 

1. Generate possible C^-C/ combinations. 

2. Filter by constraints, e.g., contra-indexing, 
sortal predicates, centering rules and con- 
straints. 

3. Rank by transition orderings. 

To illustrate this algorithm, we consider example (|l|) 
( ^rennan et al., 1987 ) which has two different final 
utterances (|I|d) and (|T]d')- Utterance (|T]d) contains 
one pronoun, utterance ( |d') two pronouns. We look 
at the interpretation of (Iji) and ([T]d'). After step 2, 
the algorithm has produced two readings for each 
variant which are rated by the corresponding tran- 
sitions in step 3. In (jljd), the pronoun "she" is 
resolved to "her" (= Brennan) because the CON- 
TINUE transition is ranked higher than SMOOTH- 
SHIFT in the second reading. In (|]d'), the pronoun 
"she" is resolved to "Friedman" because SMOOTH- 
SHIFT is preferred over ROUGH-SHIFT. 

(1) a. Brennan drives an Alfa Romeo. 

b. She drives too fast. 

c. Friedman races her on weekends. 

d. She goes to Laguna Seca. 
d.' She often beats her. 



3 An Alternative to Centering 

3.1 The Model 

The realization and the structure of my model de- 
parts significantly from the centering model: 

• The model consists of one construct with one 
operation: the list of salient discourse entities 
(S-list) with an insertion operation. 

• The S-list describes the attentional state of the 
hearer at any given point in processing a dis- 
course. 

• The S-list contains some (not necessarily all) 
discourse entities which are realized in the cur- 
rent and the previous utterance. 

• The elements of the S-list are ranked according 
to their information status. The order among 
the elements provides directly the preference 
for the interpretation of anaphoric expressions. 

In contrast to the centering model, my model does 
not need a construct which looks back; it does not 
need transitions and transition ranking criteria. In- 
stead of using the Cb to account for local coherence, 
in my model this is achieved by comparing the first 
element of the S-list with the preceding state. 

3.2 S-List Ranking 

[Strube & Hahn (1996 ) rank the C/ according to the 
information status of discourse entities. I here gen- 
eralize these ranking criteria by redefining them in 



Prince[ 's ( |1981[ ; [1992[ ) terms. I distinguish between 
three different sets of expressions, hearer-old dis- 
course entities (OLD), mediated discourse entities 
(MED), and hearer-new discourse entities (NEW). 
These sets consist of the elements of Prince's fa- 
miliarity scale ( Prince, 1981 , p.245). OLD con- 



sists of evoked (E) and unused (U) discourse entities 
while NEW consists of brand-new (BN) discourse 
entities. MED consists of inferrables (I), con- 
taining inferrables (I*-") and anchored brand-new 
(BN"^) discourse entities. These discourse entities 
are discourse-new but mediated by some hearer-old 
discourse entity (cf. Figure [l]). I do not assume any 
difference between the elements of each set with re- 
spect to their information status. E.g., evoked and 
unused discourse entities have the same information 
status because both belong to OLD. 

For an operationalization of Prince's terms, I stip- 
ulate that evoked discourse entitites are co-referring 
expressions (pronominal and nominal anaphora, 
previously mentioned proper names, relative pro- 
nouns, appositives). Unused discourse entities are 




Figure 1: S-list Ranking and Familiarity 

proper names and tides. In texts, brand-new proper 
names are usually accompanied by a relative clause 
or an appositive which relates them to the hearer's 
knowledge. The corresponding discourse entity is 
evoked only after this elaboration. Whenever these 
linguistic devices are missing, proper names are 
treated as unusec^ I restrict inferrables to the par- 
ticular subset defined by Hahn et al. (1996 ). An- 
chored brand-new discourse entities require that the 
anchor is either evoked or unused. 

I assume the following conventions for the rank- 
ing constraints on the elements of the S-list. The 
3-tuple {x,uttx,poSx) denotes a discourse entity x 
which is evoked in utterance uttx at the text posi- 
tion posx- With respect to any two discourse en- 
tities {x,uttx,poSx) and {y,utty,poSy), uttx and 
utty specifying the current utterance Ui or the pre- 
ceding utterance C/j-i, I set up the following order- 
ing constraints on elements in the S-list (Table 
For any state of the processor/hearer, the ordering 
of discourse entities in the S-list that can be derived 
from the ordering constraints (1) to (3) is denoted 
by the precedence relation -<. 



(1) If s G OLD and y G MED, then x ^ y. 
If a; G OLD and y G NEW, then x ^y. 
If a; G MED and y G NEW, then x ^y. 

(2) If y G OLD, orx,y € MED, or x,y e NEW, 
then if uttx >- utty, then x ^ y, 

if uttx = utty and posx < posy, then x ^ y. 



Table 2: Ranking Constraints on the S-list 

Summarizing Table ^ I state the following pref- 
erence ranking for discourse entities in Ui and Ui-i: 
hearer-old discourse entities in Ui, hearer-old dis- 
course entities in f/j-i, mediated discourse entities 
in Ui, mediated discourse entities in f/j-i, hearer- 
new discourse entities in Ui, hearer-new discourse 
entities in Ui-i. By making the distinction in (2) 

' For examples of brand-new proper names and their intro- 
duction cf., e.g., the "obituaries" section of the New York Times. 

^The relations >~ and = indicate that the utterance containing 
X follows {y-) the utterance containing y or that x and y are 
elements of the same utterance (=). 



between discourse entities in Ui and discourse enti- 
ties in C/j_i, I am able to deal with intra-sentential 
anaphora. There is no need for further specifications 
for complex sentences. A finer grained ordering is 
achieved by ranking discourse entities within each 
of the sets according to their text position. 

3.3 The Algorithm 

Anaphora resolution is performed with a simple 
look-up in the S-list^. The elements of the S-list are 
tested in the given order until one test succeeds. Just 
after an anaphoric expression is resolved, the S-list 
is updated. The algorithm processes a text from left 
to right (the unit of processing is the word): 

1 . If a referring expression is encountered, 

(a) if it is a pronoun, test the elements of the 
S-list in the given order until the test suc- 
ceedsQ; 

(b) update S-list; the position of the referring 
expression under consideration is deter- 
mined by the S -list-ranking criteria which 
are used as an insertion algorithm. 

2. If the analysis of utterance Jj^ is finished, re- 
move all discourse entities from the S-list, 
which are not realized in U. 

The analysis for example (|l|) is given in Table 10. 
I show only these steps which are of interest for the 
computation of the S-list and the pronoun resolu- 
tion. The preferences for pronouns (in bold font) 
are given by the S-list immediately above them. The 
pronoun "she" in (|l|b) is resolved to the first el- 
ement of the S-list. When the pronoun "her" in 
(|l]c) is encountered, FRIEDMAN is the first element 
of the S-list since FRIEDMAN is unused and in the 
current utterance. Because of binding restrictions, 
"her" cannot be resolved to Friedman but to the 
second element, Brennan. In both (jT]d) and (|I]d') 
the pronoun "she" is resolved to Friedman. 

^^The S-list consists of referring expressions which are spec- 
ified for text position, agreement, sortal information, and infor- 
mation status. Coordinated NPs are collected in a set. The S- 
list does not contain predicative NPs, pleonastic "it", and any 
elements of direct speech enclosed in double quotes. 

"'The test for pronominal anaphora involves checking agree- 
ment criteria, binding and sortal constraints. 

'l here define that an utterance is a sentence. 

*In the following Tables, discourse entities are represented 
by SmallCaps, while the corresponding surface expression 
appears on the right side of the colon. Discourse entitites are 
annotated with their information status. An "e" indicates an 
elliptical NP. 



#) 


Brennan drives an Alfa Romeo 
S: [BrennaNu: Brennan, 

Alfa ROMEOsjv: Alfa Romeo] 


(§b) 

ea 


She drives too fast. 

S: [BrennaNe: she] 


#) 


Friedman 

S: [Friedman^: Friedman, BrennaNb: she] 
races her on weekends. 

S: [FriedmaNl/: Friedman, BrennaNb: her] 


— fT 

#1) 


She drives to Laguna Seca. 
S: [FriedmaNb: she, 

Laguna SecAc/: Laguna Seca] 


(0d') 


She 

S: [FriedmaNb: she, BrennaNb: her] 
often beats her. 

S: [FriedmaNb: she, BrennaNb: her] 





Brennan drives an Alfa Romeo 
S: [BrennaNc/: Brennan, 

Alfa ROMEOsjv: Alfa Romeo] 


#) 

ea 


She drives too fast. 

S: [BrennaNb: she] 


#) 


A professional driver 

S: [BrennaNb: she, DriveRsat: Driver] 
races her on weekends. 

S: [BrennaNb: her, DriveRsjv: Driver] 


— d — 

w 


She drives to Laguna Seca. 
S: [BrennaNb: she, 

Laguna SecAu: Laguna Seca] 


dd') 


She 

S: [BrennaNb: she, DriveRsat: Driver] 
often beats her. 

S: [BrennaNb: she, DriveRb: her] 



Table 3: Analysis for ([T]) 



Table 4: Analysis for i 



The difference between my algorithm and the 
BFP-algorithm becomes clearer when the unused 
discourse entity "Friedman " is replaced by a brand- 
new discourse entity, e.g., "a professional driver'^ 
(cf. example (^). In the BFP-algorithm, the rank- 
ing of the C/-list depends on grammatical roles. 
Hence, Driver is ranked higher than Brennan in 
the C/(|c). In (gd), the pronoun "she" is resolved 
to Brennan because of the preference for con- 
tinue over RETAIN. In (||d'), "she" is resolved to 
Driver because smooth-shift is preferred over 
ROUGH-SHIFT. In my algorithm, at the end of (|c) 
the evoked phrase "her" is ranked higher than the 
brand-new phrase "a professional driver" (cf. Ta- 
ble^. In both (||d) and (^') the pronoun "she" is 
resolved to Brennan. 

(2) a. Brennan drives an Alfa Romeo. 

b. She drives too fast. 

c. A professional driver races her on weekends. 

d. She goes to Laguna Seca. 
d.' She often beats her. 

Example (^)[| illustrates how the preferences for 
intra- and inter-sentential anaphora interact with the 
information status of discourse entitites (Table 
Sentence (|3|a) starts a new discourse segment. The 
phrase "a judge " is brand-new. "Mr Curtis " is 
mentioned several times before in the text, Hence, 

'l owe this variant Andrew Kehler. - This example can mis- 
direct readers because the phrase "a professional driver" is as- 
signed the "default" gender masculine. Anyway, this example 
- like the original example - seems not to be felicitous English 
and has only illustrative character. 

'^In: The New York Times. Dec. 7, 1997, p.A48 ("Shot in 
head, suspect goes free, then to college"). 



the discourse entity CURTiS is evoked and ranked 
higher than the discourse entity JUDGE. In the 
next step, the ellipsis refers to JUDGE which is 
evoked then. The nouns "request" and "prosecu- 
tors" are brand-newf^. The pronoun "he" and the 
possessive pronoun "his" are resolved to CURTIS. 
"Condition" is brand-new but anchored by the pos- 
sessive pronoun. For (|3|b) and (^) I show only 
the steps immediately before the pronouns are re- 
solved. In (^) both "Mr Curtis" and "the judge" 
are evoked. However, "Mr Curtis" is the left-most 
evoked phrase in this sentence and therefore the 
most preferred antecedent for the pronoun "him". 
For my experiments I restricted the length of the 
S-list to five elements. Therefore "prosecutors" in 
(||b) is not contained in the S-list. The discourse 
entity Smirga is introduced in (^). It becomes 
evoked after the appositive. Hence Smirga is the 
most preferred antecedent for the pronoun "he". 

(3) a. A judge ordered that Mr. Curtis be released, but 
e agreed with a request from prosecutors that he 
be re-examined each year to see if his condition 
has improved. 

b. But authorities lost contact with Mr. Curtis after 
the Connecticut Supreme Court ruled in 1990 
that the judge had erred, and that prosecutors 
had no right to re-examine him. 

c. John Smirga, the assistant state's attorney in 
charge of the original case, said last week that 
he always had doubts about the psychiatric re- 
ports that said Mr. Curtis would never improve. 



''l restrict inferrables to the cases specified by Hahn et al. 
( 1 996 ). Therefore "prosecutors" is brand-new (cf. P rince 
(1992) for a discussion of the form of inferrables). 



A judge 

S: [JudgEsat: judge] 
ordered that Mr. Curtis 

S: [CurtiSb: Mr. Curtis, JudgEbjv: judge] 
be released, but e 

S: [CurtiSb: Mr. Curtis, JudgEb: e] 
agreed with a request 

S: [CurtiSb: Mr. Curtis, JudgEb: e, RequesTsat: request] 
from prosecutors 

S: [CurtiSe: Mr. Curtis, JudgEb: e, REQUESTsiv: request, ProsecutorSbjv: prosecutors] 
that he 

S: [CurtiSb: he, JudgEb: e, RequesTbat: request, ProsecutorSsat: prosecutors] 
be re-examined each year 

S: [CurtiSb: he, JudgEb: e, RequesTbat: request, ProsecutorSbat: prosecutors, YeaRbjv: year] 
to see if his 

S: [Curtis^: his. Judges: e, RequesTbat: request, ProsecutorSsjv: prosecutors, YeaRbat: year] 
condition 

S: [CurtiSe: his, JudgEb: e, Condition^^a : condition, RequesTbjv: request, ProsecutorSbjv: prosec] 
has improved. 

S: [CurtiSb: his, JudgEb: e, ConditioNbat^ : condition, RequesTsat: request, ProsecutorSsat: prosec] 



But authorities lost contact with Mr. Curtis after the Connecticut Supreme Court ruled in 1990 that the judge had 
erred, and that prosecutors had no right 

S: [CurtiSb: his, CS CoURTy: CS Court, JUDGEs: judge, Condition^^a : condition, Auth.bat: auth.] 
to re-examine him. 

S: [CurtiSb: him, CS COURT [/: CS Court, JudgEb: judge, CONDITIONgAr^: condition, Auth.bat: auth.] 



John Smirga, the assistant state's attorney in charge of the original case, said last week 

S: [SmirgAb: attorney, CasEb: case, CURTISs: him, CS COURT (j: CS Court, JudgEe: judge ] 

that he had doubts about the psychiatric reports that said Mr. Curtis would never improve. 

S: [SmirgAb: he, CasEb: case, ReportSb: reports, CurtiSb: Mr. Curtis, DoubtSba?: doubts] 



Table 5: Analysis for < 



4 Some Empirical Data 

In the first experiment, I compare my algorithm with 
the BFP-algorithm which was in a second experi- 
ment extended by the constraints for complex sen- 
tences as described by Kameyama (1998| ). 

Method. I use the following guidelines for the 



hand-simulated analysis ( [Walker, 1989[ ). I do not as- 
sume any world knowledge as part of the anaphora 
resolution process. Only agreement criteria, bind- 
ing and sortal constraints are applied. I do not ac- 
count for false positives and error chains. Following 



Walker (19891 ), a segment is defined as a paragraph 
unless its first sentence has a pronoun in subject po- 
sition or a pronoun where none of the preceding 
sentence-internal noun phrases matches its syntactic 
features. At the beginning of a segment, anaphora 
resolution is preferentially performed within the 
same utterance. My algorithm starts with an empty 
S-list at the beginning of a segment. 

The basic unit for which the centering data struc- 
tures are generated is the utterance U. For the BFP- 
algorithm, I define U as a. simple sentence, a com- 
plex sent ence, or eac h full cl ause of a compound 
sentence. Kameyama 's ( 1998 ) intra-sentential cen- 
tering operates at the clause level. While tensed 



clauses are defined as utterances on their own, un- 
tensed clauses are processed with the main clause, 
so that the C/-list of the main clause contains 
the elements of the untensed embedded clause. 



Kameyama distinguishes for tensed clauses further 
between sequential and hierarchical centering. Ex- 
cept for reported speech (embedded and inaccessi- 
ble to the superordinate level), non-report comple- 
ments, and relative clauses (both embedded but ac- 
cessible to the superordinate level; less salient than 
the higher levels), all other types of tensed clauses 
build a chain of utterances on the same level. 

According to the preference for inter-sentential 
candidates in the centering model, I define the fol- 
lowing anaphora resolution strategy for the BFP- 
algorithm: (1) Test elements of Ui-i. (2) Test el- 
ements of Ui left-to-right. (3) Test elements of 
Cf{Ui.2), C/(C/i_3), - In my algorithm steps (1) 
and (2) fall together. (3) is performed using previ- 
ous states of the system. 

Results. The test set consisted of the beginnings 
of three short stories by Hemingway (2785 words, 
153 sentences) and three articles from the New 
York Times (4546 words, 233 sentences). The re- 
sults of my experiments are given in Table 0. The 



first row gives tlie number of personal and posses- 
sive pronouns. The remainder of tlie Table shows 
the results for the BFP-algorithm, for the BFP- 



algorithm extended by Kameyama's intra-sentential 



specifications, and for my algorithm. The overall 
error rate of each approach is given in the rows 
marked with wrong. The rows marked with wrong 
(strat.) give the numbers of errors directly produced 
by the algorithms' strategy, the rows marked with 
wrong (ambig.) the number of analyses with am- 
biguities generated by the BFP-algorithm (my ap- 
proach does not generate ambiguities). The rows 
marked with wrong (intra) give the number of er- 
rors caused by (missing) specifications for intra- 
sentential anaphora. Since my algorithm integrates 
the specifications for intra-sentential anaphora, I 
count these errors as strategic errors. The rows 
marked with wrong (chain) give the numbers of er- 
rors contained in error chains. The rows marked 
with wrong (other) give the numbers of the remain- 
ing errors (consisting of pronouns with split an- 
tecedents, errors because of segment boundaries, 
and missing specifications for event anaphora). 





Hem. 


NYT 


S 


Pron. and Poss. Pron. 


274 


302 


576 




Correct 


189 


231 


420 




Wrong 


85 


71 


156 




Wrong (strat.) 


14 


2 


16 


BFP-Algo. 


Wrong (ambig.) 


9 


15 


24 




Wrong (intra) 


17 


13 


30 




Wrong (cliain) 


29 


32 


61 




Wrong (other) 


16 


9 


25 




Correct 


193 


245 


438 




Wrong 


81 


57 


138 




Wrong (strat.) 


3 





3 


BFP/Kam. 


Wrong (ambig.) 


17 


8 


25 




Wrong (intra) 


17 


27 


44 




Wrong (chain) 


29 


15 


44 




Wrong (other) 


15 


7 


22 




Correct 


217 


275 


492 




Wrong 


57 


27 


84 


My Algo. 


Wrong (strat.) 


21 


12 


33 




Wrong (chain) 


22 


9 


31 




Wrong (other) 


14 


6 


20 



Table 6: Evaluation Results 

Interpretation. The results of my experiments 
showed not only that my algorithm performed bet- 
ter than the centering approaches but also revealed 
insight in the interaction between inter- and intra- 
sentential preferences for anaphoric antecedents. 
Kameyarn^ 's specifications reduce the complexity 



BFP-algorithm combined with her specifications 
has almost no strategic errors while the number of 
ambiguities remains constant. But this benefit is 
achieved at the expense of more errors caused by the 
intra-sentential specifications. These errors occur in 
cases like example (|3|), in which Kameyama's intra- 
sentential strategy makes the correct antecedent less 
salient, indicating that a clause-based approach is 
too fine-grained and that the hierarchical syntactical 



in that the Cy-lists in general are shorter after split- 
ting up a sentence into clauses. Therefore, the 



structure as assumed by |Kameyama| does not have a 
great impact on anaphora resolution. 

I noted, too, that the BFP-algorithm can gener- 
ate ambiguous readings for Ui when the pronoun 
in Ui does not co-specify the Cb{Ui-i). In cases, 
where the Cf{Ui-i) contains more than one possi- 
ble antecedent for the pronoun, several ambiguous 
readings with the same transitions are generated. 
An example^]: There is no C&(^) because no ele- 
ment of the preceding utterance is realized in (^). 
The pronoun "f/zem" in (^) co-specifies "deer" hut 
the BFP-algorithm generates two readings both of 
which are marked by a RETAIN transition. 

(4) a. Jim pulled the burlap sacks off the deer 
b. and Liz looked at them. 

In general, the strength of the centering model is 
that it is possible to use the C6(C/j_i) as the most 
preferred antecedent for a pronoun in Ui. In my 
model this effect is achieved by the preference for 
hearer-old discourse entities. Whenever this prefer- 
ence is misleading both approaches give wrong re- 
sults. Since the Cb is defined strictly local while 
hearer-old discourse entities are defined global, my 
model produces less errors. In my model the pref- 
erence is available immediately while the BFP- 
algorithm can use its preference not before the sec- 
ond utterance has been processed. The more global 
definition of hearer-old discourse entities leads also 
to shorter error chains. - However, the test set is 
too small to draw final conclusions, but at least for 
the texts analyzed the preference for hearer-old dis- 
course entities is more appropriate than the prefer- 
ence given by the BFP- algorithm. 

5 Comparison to Related Approaches 

Kameyam|'s ( [T9981 ) version of centering also omits 
the centering transitions. But she uses the Cb and 
a ranking over simplified transitions preventing the 
incremental application of her model. 

'"in: Ernest Hemingway. Up in Michigan. In. The Com- 
plete Short Stories of Ernest Hemingway. New York: Charles 
Scribner's Sons, 1987, p.60. 



T he focus model (|Sidner, 1983|; Suri & McCoy, 
1994) accounts for evoked discourse entities explic- 
itly because it uses the discourse focus, which is de- 
termined by a successful anaphora resolution. In- 
cremental processing is not a topic of these papers. 

Even models which use salience measures for de- 
termining the antecedents of pronoun use the con- 
cept of evoked discourse entities. H ajicova et al. 
(1992) assign the highest value to an evoked dis- 
course entity. Also ^^.appin & Leass (1994 ), who 
give the subject of the current sentence the high- 
est weight, have an implicit notion of evokedness. 
The salience weight degrades from one sentence to 
another by a factor of two which implies that a re- 
peatedly mentioned discourse entity gets a higher 
weight than a brand-new subject. 

6 Conclusions 

In this paper, I proposed a model for determining 
the hearer's attentional state which is based on the 
distinction between hearer-old and hearer-new dis- 
course entities. I showed that my model, though 
it omits the backward-looking center and the cen- 
tering transitions, does not lose any of the predic- 
tive power of the centering model with respect to 
anaphora resolution. In contrast to the centering 
model, my model includes a treatment for intra- 
sentential anaphora and is sufficiently well specified 
to be applied to real texts. Its incremental character 
seems to be an answer to the question Kehler (1997 ) 
recently raised. Furthermore, it neither has the prob- 
lem of inconsistency Kehlei mentioned with respect 
to the BFP-algorithm nor does it generate unneces- 
sary ambiguities. 

Future work will address whether the text posi- 
tion, which is the weakest grammatical concept, is 
sufficient for the order of the elements of the S-list 
at the second layer of my ranking constraints. I will 
also try to extend my model for the analysis of def- 
inite noun phrases for which it is necessary to inte- 
grate it into a more global model of discourse pro- 
cessing. 
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